Implementation of realtime STRAIGHT speech manipulation system: Report on its first implementation

نویسندگان

  • Hideki Banno
  • Hiroaki Hata
  • Masanori Morise
  • Toru Takahashi
  • Toshio Irino
  • Hideki Kawahara
چکیده

A very high quality speech analysis, modification and synthesis system—STRAIGHT— has now been implemented in C language and operated in realtime. This article first provides a brief summary of STRAIGHT components and then introduces the underlying principles that enabled realtime operation. In STRAIGHT, the built-in extended pitch synchronous analysis, which does not require analysis window alignment, plays an important role in realtime implementation. A detailed description of the processing steps, which are based on the so-called ‘‘just-in-time’’ architecture, is presented. Further, discussions on other issues related to realtime implementation and performance measures are also provided. The software will be available to researchers upon request.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of an Intelligent Part of Speech Generator

The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...

متن کامل

TANDEM-STRAIGHT, a research tool for L2 study enabling flexible manipulations of prosodic information

A speech analysis, modification, and resynthesis system called STRAIGHT has been widely used in the speech research community. However, its foundation and implementation were not well established. This lecture introduces recent advances in STRAIGHT’s foundation based on a new concept called TANDEM, a simple method for calculating temporally stable power spectra using two F0-adaptive time window...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

A continuous VQ clustering algorithm for realtime speech recognition

This paper presents a continuous VQ clustering (CVQC) algorithm for realtime speech recognition, which incorporates the temporal information of speech into both training and recognition processes. In comparison with the conventional DTW and VQ methods, this new algorithm delivers faster training and recognition speed and smaller codebook size while still retains merits of both. Realtime impleme...

متن کامل

Implementation of The First Medical science Olympiad in Iran: A report

The first national medical science Olympiad suggested by Isfahan University of Medical Sciences was hold in 2009 in Isfahan. The venture had the mission to identify and flourish potentials in Iranian medical science students - the health system's capital. The ministry of health in collaboration with the affiliated universities hosted 364 medical science students. Students formed teams of three ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007